Emergent Complexity via Multi-Agent Competition

نویسندگان

  • Trapit Bansal
  • Jakub W. Pachocki
  • Szymon Sidor
  • Ilya Sutskever
  • Igor Mordatch
چکیده

Reinforcement learning algorithms can train agents that solve problems in complex, interesting environments. Normally, the complexity of the trained agent is closely related to the complexity of the environment. This suggests that a highly capable agent requires a complex environment for training. In this paper, we point out that a competitive multi-agent environment trained with self-play can produce behaviors that are far more complex than the environment itself. We also point out that such environments come with a natural curriculum, because for any skill level, an environment full of agents of this level will have the right level of difficulty. This work introduces several competitive multi-agent environments where agents compete in a 3D world with simulated physics. The trained agents learn a wide variety of complex and interesting skills, even though the environment themselves are relatively simple. The skills include behaviors such as running, blocking, ducking, tackling, fooling opponents, kicking, and defending using both arms and legs. A highlight of the learned behaviors can be found here: https://goo.gl/eR7fbX.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-agent Model to Control Production System: A Reactive and Emergent Approach by Cooperation and Competition between Agents

In view of strong competitive markets, current companies tend to new methods of production, from a logic of «planning» type of production to a logical of «Just in time » type. In this context, the system that allows controlling the production has to be a modular, flexible and reactive system. The hierarchized and classical approaches don’t permit any more to take into account the complexity lin...

متن کامل

Dynamic configuration and collaborative scheduling in supply chains based on scalable multi-agent architecture

Due to diversified and frequently changing demands from customers, technological advances and global competition, manufacturers rely on collaboration with their business partners to share costs, risks and expertise. How to take advantage of advancement of technologies to effectively support operations and create competitive advantage is critical for manufacturers to survive. To respond to these...

متن کامل

An architecture for identifying emergent behavior in multi-agent systems

Multi-agent systems exhibit unexpected, emergent behavior as a result of the complexity of agent behaviors and their interactions. Despite significant research interest in the past decades, computational methods to identify and analyze emergence as it happens are still needed. This paper proposes a software architecture for identifying emergent behavior in a multi-agent system as it happens, us...

متن کامل

Distributed Simulation of Agent-based Models

There has been considerable recent interest in complex systems, which involve dynamic and unpredictable interactions between large numbers of components including software, hardware devices (such as sensors), and social entities (people or collective bodies). Examples of such systems include from traditional embedded systems, to systems controlling critical infrastructures, such as defence, ene...

متن کامل

A simulation approach to design contracts that govern emergent multi-agent systems

Agent-based normative systems offer the potential for a business to model, understand the consequences of, and then refine contracts to improve the outcomes for that business. In this paper, we combine a simulation technique designed for investigating and tuning emergent behavior in multi-agent systems with an approach to modeling norms of the complexity found in business contracts. We believe ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1710.03748  شماره 

صفحات  -

تاریخ انتشار 2017